A Diachronic Approach for Schwa Deletion in Indo Aryan Languages
نویسندگان
چکیده
Schwa deletion is an important issue in grapheme-to-phoneme conversion for IndoAryan languages (IAL). In this paper, we describe a syllable minimization based algorithm for dealing with this that outperforms the existing methods in terms of efficiency and accuracy. The algorithm is motivated by the fact that deletion of schwa is a diachronic and sociolinguistic phenomenon that facilitates faster communication through syllable economy. The contribution of the paper is not just a better algorithm for schwa deletion; rather we describe here a constrained optimization based framework that can partly model the evolution of languages, and hence, can be used for solving many problems in computational linguistics that call for diachronic explanations.
منابع مشابه
Aspect shifts in Indo-Aryan
The grammaticalization literature notes the cross-linguistic robustness of a diachronic pattern involving the aspectual categories resultative, perfect, and perfective. Resultative aspect markers often develop into perfect markers, which then end up as perfect plus perfective markers. We introduce supporting data from the history of Old and Middle Indo-Aryan languages, whose instantiation of th...
متن کاملAspect shifts in Indo-Aryan and trajectories of semantic change1
The grammaticalization literature notes the cross-linguistic robustness of a diachronic pattern involving the aspectual categories resultative, perfect, and perfective. Resultative aspect markers often develop into perfect markers, which then end up as perfect plus perfective markers. We introduce supporting data from the history of Old and Middle Indo-Aryan languages, whose instantiation of th...
متن کاملDialects in the Indo-Aryan landscape
The Indo-Aryan language family currently occupies a significant region of the Indian subcontinent, its member languages being spoken in the bulk of North India, as well as in Pakistan, Bangladesh, Nepal, Sri Lanka, and the Maldives. The historical depth of the textual record and the geographical breadth of the Indo-Aryan linguistic area, the diversity of its languages (226 in all), and its many...
متن کاملThe Relationship between Case Marking and S, A, and O in Spoken Sinhala
1. INTRODUCTION. In this paper I examine the relationship between case marking and S, A, and O in spoken Sinhala. I will demonstrate that case roles are not assigned on the basis of grammatical relations, but rather they depend on a series of semantic and lexical principles including volitivity, animacy, semantic roles, and definiteness. This paper will furthermore provide evidence for S, A, an...
متن کاملWhy Indo-Aryan languages adapt English alveolars as reʈroflexes: Acoustic evidence from Punjabi
In Indo-Aryan languages, English loanwords containing the alveolar /t/ are always adapted as retroflex /ʈ/ [1]. It is argued that English alveolars share the cues of release burst with the retroflexes in Indo-Aryan languages [2]. However, no quantitative acoustic evidence is provided by [2] as to what acoustic cues of English alveolars are important for the speakers of Indo-Aryan languages to a...
متن کامل